Search CORE

236 research outputs found

Understanding the language of gene regulation

Author: Alkema Wynand
Wasserman Wyeth W
Publication venue: BioMed Central
Publication date: 01/01/2003
Field of study

A report on the Cold Spring Harbor Laboratory meeting 'Systems Biology: genomic approaches to transcriptional regulation', Cold Spring Harbor, USA, 6-9 March 2003

Hanze UAS repository

PubMed Central

Identification of cis-regulatory sequence variations in individual genome sequences

Author: Bernard Virginie
Wasserman Wyeth W
Worsley-Hunt Rebecca
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Functional contributions of cis-regulatory sequence variations to human genetic disease are numerous. For instance, disrupting variations in a HNF4A transcription factor binding site upstream of the Factor IX gene contributes causally to hemophilia B Leyden. Although clinical genome sequence analysis currently focuses on the identification of protein-altering variation, the impact of cis-regulatory mutations can be similarly strong. New technologies are now enabling genome sequencing beyond exomes, revealing variation across the non-coding 98% of the genome responsible for developmental and physiological patterns of gene activity. The capacity to identify causal regulatory mutations is improving, but predicting functional changes in regulatory DNA sequences remains a great challenge. Here we explore the existing methods and software for prediction of functional variation situated in the cis-regulatory sequences governing gene transcription and RNA processing

Crossref

PubMed Central

Identification of conserved regulatory elements by comparative genome analysis

Author: Engström Pär
Jareborg Niclas
Lenhard Boris
Mendoza Luis
Sandelin Albin
Wasserman Wyeth W
Publication venue: BioMed Central
Publication date: 01/01/2003
Field of study

BACKGROUND: For genes that have been successfully delineated within the human genome sequence, most regulatory sequences remain to be elucidated. The annotation and interpretation process requires additional data resources and significant improvements in computational methods for the detection of regulatory regions. One approach of growing popularity is based on the preferential conservation of functional sequences over the course of evolution by selective pressure, termed 'phylogenetic footprinting'. Mutations are more likely to be disruptive if they appear in functional sites, resulting in a measurable difference in evolution rates between functional and non-functional genomic segments. RESULTS: We have devised a flexible suite of methods for the identification and visualization of conserved transcription-factor-binding sites. The system reports those putative transcription-factor-binding sites that are both situated in conserved regions and located as pairs of sites in equivalent positions in alignments between two orthologous sequences. An underlying collection of metazoan transcription-factor-binding profiles was assembled to facilitate the study. This approach results in a significant improvement in the detection of transcription-factor-binding sites because of an increased signal-to-noise ratio, as demonstrated with two sets of promoter sequences. The method is implemented as a graphical web application, ConSite, which is at the disposal of the scientific community at . CONCLUSIONS: Phylogenetic footprinting dramatically improves the predictive selectivity of bioinformatic approaches to the analysis of promoter sequences. ConSite delivers unparalleled performance using a novel database of high-quality binding models for metazoan transcription factors. With a dynamic interface, this bioinformatics tool provides broad access to promoter analysis with phylogenetic footprinting

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Compensating for literature annotation bias when predicting novel drug-disease relationships through Medical Subject Heading Over-representation Profile (MeSHOP) similarity

Author: BF Francis Ouellette
Warren A Cheung
Wyeth W Wasserman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Crossref

SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions

Author: Byron Yu-Lin Kuo
Elizabeth M Simpson
Slavita Bohacec
Susan Baxter
Wyeth W Wasserman
Ying Chen
Öjvind Johansson
Publication venue: Public Library of Science
Publication date: 01/01/2006
Field of study

Serial analysis of gene expression (SAGE) not only is a method for profiling the global expression of genes, but also offers the opportunity for the discovery of novel transcripts. SAGE tags are mapped to known transcripts to determine the gene of origin. Tags that map neither to a known transcript nor to the genome were hypothesized to span a splice junction, for which the exon combination or exon(s) are unknown. To test this hypothesis, we have developed an algorithm, SAGE2Splice, to efficiently map SAGE tags to potential splice junctions in a genome. The algorithm consists of three search levels. A scoring scheme was designed based on position weight matrices to assess the quality of candidates. Using optimized parameters for SAGE2Splice analysis and two sets of SAGE data, candidate junctions were discovered for 5%–6% of unmapped tags. Candidates were classified into three categories, reflecting the previous annotations of the putative splice junctions. Analysis of predicted tags extracted from EST sequences demonstrated that candidate junctions having the splice junction located closer to the center of the tags are more reliable. Nine of these 12 candidates were validated by RT-PCR and sequencing, and among these, four revealed previously uncharacterized exons. Thus, SAGE2Splice provides a new functionality for the identification of novel transcripts and exons. SAGE2Splice is available online at http://www.cisreg.ca

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A new generation of JASPAR, the open-access repository for transcription factor binding site profiles

Author: De Bleser Pieter J.
Lenhard Boris
Sandelin Albin
van Roy Frans
Vleminckx Kris
Vlieghe Dominique
Wasserman Wyeth W.
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

JASPAR is the most complete open-access collection of transcription factor binding site (TFBS) matrices. In this new release, JASPAR grows into a meta-database of collections of TFBS models derived by diverse approaches. We present JASPAR CORE—an expanded version of the original, non-redundant collection of annotated, high-quality matrix-based transcription factor binding profiles, JASPAR FAM—a collection of familial TFBS models and JASPAR phyloFACTS—a set of matrices computationally derived from statistically overrepresented, evolutionarily conserved regulatory region motifs from mammalian genomes. JASPAR phyloFACTS serves as a non-redundant extension to JASPAR CORE, enhancing the overall breadth of JASPAR for promoter sequence analysis. The new release of JASPAR is available at

Crossref

Ghent University Academic Bibliography

PubMed Central

oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes

Author: Arenillas David J.
Brumm Jochen
Ho Sui Shannan J.
Kennedy Brian P.
Mortimer James R.
Walsh Christopher J.
Wasserman Wyeth W.
Publication venue: Oxford University Press
Publication date: 02/06/2005
Field of study

Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes

Crossref

PubMed Central

In Silico Detection of Sequence Variations Modifying Transcriptional Regulation

Author: Boris Lenhard
David Arenillas
Gary Stormo
Jacob Odeberg
Malin C Andersen
Per Eriksson
Pär G Engström
Stuart Lithwick
Wyeth W Wasserman
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers). The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation

CiteSeerX

Public Library of Science (PLOS)

University of Bergen

Crossref

Directory of Open Access Journals

PubMed Central

NORA - Norwegian Open Research Archives

JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

Author: Arenillas David J
Chen Chih-Yu
Denay Grégoire
Fornes Oriol
Lee Jessica
Lenhard Boris
Mathelier Anthony
Parcy François
Sandelin Albin
Shi Wenqiang
Shyr Casper
Tan Ge
Wasserman Wyeth W.
Worsley-Hunt Rebecca
Zhang Allen W
Publication venue: 'Oxford University Press (OUP)'
Publication date: 04/01/2016
Field of study

International audienceJASPAR (http://jaspar.genereg.net) is an open-access database storing curated, non-redundant transcription factor (TF) binding profiles representing transcription factor binding preferences as position frequency matrices for multiple species in six taxonomic groups. For this 2016 release, we expanded the JASPAR CORE collection with 494 new TF binding profiles (315 in vertebrates, 11 in nematodes, 3 in insects, 1 in fungi and 164 in plants) and updated 59 profiles (58 in vertebrates and 1 in fungi). The introduced profiles represent an 83% expansion and 10% update when compared to the previous release. We updated the structural annotation of the TF DNA binding domains (DBDs) following a published hierarchical structural classification. In addition, we introduced 130 transcription factor flexible models trained on ChIP-seq data for vertebrates, which capture dinucleotide dependencies within TF binding sites. This new JASPAR release is accompanied by a new web tool to infer JASPAR TF binding profiles recognized by a given TF protein sequence. Moreover, we provide the users with a Ruby module complementing the JASPAR API to ease programmatic access and use of the JASPAR collection of profiles. Finally, we provide the JASPAR2016 R/Bioconductor data package with the data of this release

Hal - Université Grenoble Alpes